Advances in Very Low Bit Rate Speech Coding Using Recognition and Synthesis Techniques

نویسندگان

  • Geneviève Baudoin
  • François Capman
  • Jan Cernocký
  • Fadi El Chami
  • Maurice Charbit
  • Gérard Chollet
  • Dijana Petrovska-Delacrétaz
چکیده

ALISP (Automatic Language Independent Speech Processing) units are an alternative concept to using phoneme-derived units in speech processing. This article describes advances in very low bit rate coding using ALISP units. Results of speakerindependent experiments are reported and speaker clustering using vector quantization is proposed. The improvements of speech re-synthesis using Harmonic Noise Model and dynamic selection of units are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec

This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technologies. The aim of this approach is to use at best the speech corpus of the speaker. The proposed solution uses HMM modelling for the recognition of elementary speech units. The HMM are first trained in an unsupervised...

متن کامل

A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques

This paper presents a very low bit rate speech coder based on HMM (Hidden Markov Model). The encoder carries out phoneme recognition, and transmits phoneme indexes, state durations and pitch information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indexes, and a sequence of mel-cepstral coefficient vectors is generated from the concatenated HMM by using...

متن کامل

Progress Report of a Project in Very Low Bit-rate Speech Coding

Background work in various levels of speech coding is reviewed, including unconstrained coding and recognition-synthesis approaches that assume the signal is speech. A pilot project in HMM-TTS based speech coding is then described, in which a comparison with harmonic plus noise modelling is also done. Results of the demonstration project including samples of speech under various transmission si...

متن کامل

Speech compression a novel method pdf

Text summarization is a process that reduces the size of the text document. Purpose, we use part of speech tagging to recognize types of the text words. speech compression applications Compression rate is a scale to decrease the size of text summary. speech compression abstract A higher.This paper illustrates a novel method of speech compression and transmission. This method saves the transmiss...

متن کامل

Segmental vocoder-going beyond the phonetic approach

In our paper, the problem of very low bit rate segmental speech coding is addressed. The basic units are found automatically in the training database using temporal decomposition, vector quantization and multigrams. They are modelled by HMMs. The coding is based on recognition and synthesis. In single speaker tests, we obtained intelligible and naturally sounding speech at mean rate of 211.2 b/...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002